Ontology Development for ETL Process Design
نویسنده
چکیده
The Extract, Transform, Load (ETL) process design is difficult to perform because of the ambiguity of user requirements and the complexity of data integration and transformation. Current studies have explored the ontology-based approach to overcome these limitations by reconciling the semantics of user requirements within the ETL process design for easy generation of the ETL process specification. The ontology for ETL process activities has been developed by using the Requirement Analysis Method for ETL Processes (RAMEPs) that is gathered from the perspectives of organization, decision-maker, and developer. Therefore, the ontology is used to generate the ETL process specification for a student affairs’ Data Warehouse (DW) system. The correctness of the ontology model was validated by using an appropriate reasoner. Moreover, the process of ontology development for the case study is presented and shows how the ontology-based approach was successful in implementing the design and generating the ETL process specification.
منابع مشابه
Requirements Analysis Method For Extracting-Transformation-Loading (Etl) In Data Warehouse Systems
The data warehouse (DW) system design involves several tasks such as defining the DW schemas and the ETL processes specifications, and these have been extensively studied and practiced for many years. The problems in heterogeneous data integration are still far from being resolved due to the complexity of ETL processes and the fundamental problems of data conflicts in information sharing enviro...
متن کاملOntology-Driven Conceptual Design of ETL Processes Using Graph Transformations
One of the main tasks during the early steps of a data warehouse project is the identification of the appropriate transformations and the specification of inter-schema mappings from the source to the target data stores. This is a challenging task, requiring firstly the semantic and secondly the structural reconciliation of the information provided by the available sources. This task is a part o...
متن کاملFlexible and Customizable NL Representation of Requirements for ETL processes
The design of an Extract – Transform – Load (ETL) workflow for the population of a Data Warehouse is a complex and challenging procedure. In previous work, we have presented an ontology-based approach to facilitate the conceptual design of an ETL scenario. In this paper, we elaborate on this work, by investigating the application of Natural Language (NL) techniques to the ETL environment and we...
متن کاملRameps: a Goal-ontology Approach to Analyse the Requirements for Data Warehouse Systems
The data warehouse (DW) systems design involves several tasks such as defining the DW schemas and the ETL processes specifications, and these have been extensively studied and practiced for many years. However, the problems in heterogeneous data integration are still far from being resolved due to the complexity of ETL processes and the fundamental problems of data conflicts in information shar...
متن کاملA BPMN-Based Design and Maintenance Framework for ETL Processes
Business Intelligence (BI) applications require the design, implementation, and maintenance of processes that extract, transform, and load suitable data for analysis. The development of these processes (known as ETL) is an inherently complex problem that is typically costly and time consuming. In a previous work, we have proposed a vendor-independent language for reducing the design complexity ...
متن کامل